Applying the Aurora feature extraction schemes to a phoneme based recognition task
نویسندگان
چکیده
The robustness of the ETSI (European Telecommunication Standards Institute) standardized feature extraction schemes is investigated for phoneme based recognition tasks of German speech data. The recognition tasks are an isolated command word recognition and the recognition of connected digits. The motivation of this work is the easy extensibility of a whole word recognition system by allowing also the recognition of phoneme based word HMMs (Hidden Markov Models). The recognition performance has been determined for different numbers of HMM states and different numbers of Gaussians per state. It turns out that fairly high recognition rates can be achieved also for noisy data when applying the second robust ETSI frontend.
منابع مشابه
Improving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM
Improving phoneme recognition has attracted the attention of many researchers due to its applications in various fields of speech processing. Recent research achievements show that using deep neural network (DNN) in speech recognition systems significantly improves the performance of these systems. There are two phases in DNN-based phoneme recognition systems including training and testing. Mos...
متن کاملبهبود عملکرد سیستم بازشناسی گفتار پیوسته بوسیله ویژگیهای استخراج شده از مانیفولدهای گفتاری در فضای بازسازی شده فاز
The design for new feature extraction methods out of the speech signal and combination of their obtained information is one of the most effective approaches to improve the performance of automatic speech recognition (ASR) system. Recent researches have been shown that the speech signal contains nonlinear and chaotic properties, but the effects of these properties are not used in the continuous ...
متن کاملDiscriminative MLPs in HMM-based recognition of speech in cellular telephony
Deviating from the conventional Hidden Markov ModelMulti-Layer Perceptron (HMM-MLP) hybrid paradigm of using MLP for classi cation, the proposed discriminative MLP technique uses MLP as a mapping module for feature extraction for conventional HMM-based systems. The MLP is discriminatively trained on the phonetically labeled training data to generate the phoneme posterior probabilities. We achie...
متن کاملIntroducing a method for extracting features from facial images based on applying transformations to features obtained from convolutional neural networks
In pattern recognition, features are denoting some measurable characteristics of an observed phenomenon and feature extraction is the procedure of measuring these characteristics. A set of features can be expressed by a feature vector which is used as the input data of a system. An efficient feature extraction method can improve the performance of a machine learning system such as face recognit...
متن کاملPhoneme Classification Using Temporal Tracking of Speech Clusters in Spectro-temporal Domain
This article presents a new feature extraction technique based on the temporal tracking of clusters in spectro-temporal features space. In the proposed method, auditory cortical outputs were clustered. The attributes of speech clusters were extracted as secondary features. However, the shape and position of speech clusters change during the time. The clusters temporally tracked and temporal tra...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004